AITopics | image size

Collaborating Authors

image size

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

584b98aac2dddf59ee2cf19ca4ccb75e-Supplemental.pdf

Neural Information Processing SystemsApr-26-2026, 00:56:45 GMT

We used the largest batch size that could fit in memory on our limited hardware, which was 256 for an image size of 224x224. For the learning rate (Adam [2] optimizer) we searched in the range of {0.001, 0.0001, 1e04, 5e-4, 5e-5}, with weight decay {0, 5e-4. We chose a weight decay of 5e-5 and learning rate of 5e-4 until the 4:6 split and 1e-4 afterwards. We chose a prototype dimension of 256, backbone output of 512, 2 graph layers, graph hidden dimension of 512, λh of 10, Clst and Sep of 0.01. UT-Zappos we again used the Adam optimizer, with learning rate in the ranges {5e-5, 5e-4, 5e-3}, and weight decay {0, 5e-4.

artificial intelligence, dataset, machine learning, (17 more...)

Neural Information Processing Systems

Country: North America > Canada (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

1abc87c67cc400a67b869358e627fe37-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-25-2026, 12:06:28 GMT

artificial intelligence, machine learning, mvtec-ad, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.72)

Add feedback

11822e84689e631615199db3b75cd0e4-Paper-Conference.pdf

Neural Information Processing SystemsApr-25-2026, 00:48:30 GMT

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country:

North America > United States (0.28)
North America > Canada (0.28)

Industry:

Energy (1.00)
Transportation > Infrastructure & Services (0.92)
Transportation > Ground (0.67)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(3 more...)

Add feedback

Appendix of " Hierarchical Vector Quantized Transformer for Multi-class Unsupervised Anomaly Detection "

Neural Information Processing SystemsFeb-8-2026, 12:25:22 GMT

The hyperparameters β and α are set as 0.5 and 0.01 for each layer. CIF AR-10: The image size is set to 224 x 224, and the feature size is 14 x 14. The encoder and decoder layers were both set to 4. The hyperparameters β and α are set to 0.5 and 0.01 for each layer. ELBO of our variational autoencoder should include both a reconstruction likelihood and a KL term. Lower Bound (ELBO) is constant, w.r.t. the KL divergence can thus be ignored for training.

artificial intelligence, data mining, machine learning, (14 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.92)
Information Technology > Data Science > Data Mining > Anomaly Detection (0.88)

Add feedback

11822e84689e631615199db3b75cd0e4-Paper-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 00:32:52 GMT

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario > National Capital Region > Ottawa (0.14)
North America > United States > Colorado (0.04)
Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)
(2 more...)

Industry:

Energy (1.00)
Transportation > Infrastructure & Services (0.92)
Transportation > Ground (0.67)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(3 more...)

Add feedback

Another BRIXEL in the Wall: Towards Cheaper Dense Features

Lappe, Alexander, Giese, Martin A.

arXiv.org Artificial IntelligenceNov-10-2025

Vision foundation models achieve strong performance on both global and locally dense downstream tasks. Pretrained on large images, the recent DINOv3 model family is able to produce very fine-grained dense feature maps, enabling state-of-the-art performance. However, computing these feature maps requires the input image to be available at very high resolution, as well as large amounts of compute due to the squared complexity of the transformer architecture. T o address these issues, we propose BRIXEL, a simple knowledge distillation approach that has the student learn to reproduce its own feature maps at higher resolution. Despite its simplicity, BRIXEL outperforms the baseline DINOv3 models by large margins on downstream tasks when the resolution is kept fixed. Moreover, it is able to produce feature maps that are very similar to those of the teacher at a fraction of the computational cost.

artificial intelligence, machine learning, resolution, (16 more...)

arXiv.org Artificial Intelligence

2511.05168

Country: Europe (0.14)

Genre: Research Report (0.64)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.34)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.30)

Add feedback

Improve bounding box in Carla Simulator

Chaar, Mohamad Mofeed, Raiyn, Jamal, Weidl, Galia

arXiv.org Artificial IntelligenceSep-23-2025

The CARLA simulator (Car Learning to Act) serves as a robust platform for testing algorithms and generating datasets in the field of Autonomous Driving (AD). It provides control over various environmental parameters, enabling thorough evaluation. Development bounding boxes are commonly utilized tools in deep learning and play a crucial role in AD applications. The predominant method for data generation in the CARLA Simulator involves identifying and delineating objects of interest, such as vehicles, using bounding boxes. The operation in CARLA entails capturing the coordinates of all objects on the map, which are subsequently aligned with the sensor's coordinate system at the ego vehicle and then enclosed within bounding boxes relative to the ego vehicle's perspective. However, this primary approach encounters challenges associated with object detection and bounding box annotation, such as ghost boxes. Although these procedures are generally effective at detecting vehicles and other objects within their direct line of sight, they may also produce false positives by identifying objects that are obscured by obstructions. We have enhanced the primary approach with the objective of filtering out unwanted boxes. Performance analysis indicates that the improved approach has achieved high accuracy.

artificial intelligence, machine learning, simulator, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.5220/0012600500003702

2509.16773

Country: North America (0.28)

Genre: Research Report (0.64)

Industry:

Transportation > Ground > Road (0.93)
Automobiles & Trucks (0.90)

Technology:

Information Technology > Graphics (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.90)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Performance comparison of medical image classification systems using TensorFlow Keras, PyTorch, and JAX

Bećirović, Merjem, Kurtović, Amina, Smajlović, Nordin, Kapo, Medina, Akagić, Amila

arXiv.org Artificial IntelligenceJul-22-2025

Medical imaging plays a vital role in early disease diagnosis and monitoring. Specifically, blood microscopy offers valuable insights into blood cell morphology and the detection of hematological disorders. In recent years, deep learning-based automated classification systems have demonstrated high potential in enhancing the accuracy and efficiency of blood image analysis. However, a detailed performance analysis of specific deep learning frameworks appears to be lacking. This paper compares the performance of three popular deep learning frameworks, TensorFlow with Keras, PyTorch, and JAX, in classifying blood cell images from the publicly available BloodMNIST dataset. The study primarily focuses on inference time differences, but also classification performance for different image sizes. The results reveal variations in performance across frameworks, influenced by factors such as image resolution and framework-specific optimizations. Classification accuracy for JAX and PyTorch was comparable to current benchmarks, showcasing the efficiency of these frameworks for medical image classification.

artificial intelligence, machine learning, pytorch, (19 more...)

arXiv.org Artificial Intelligence

2507.14587

Country: Europe > Bosnia and Herzegovina (0.14)

Genre: